RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,4 50 TIME: 12:46:55 

Input Set : A:\seqlist.txt 
Output Set: N:\CRF3\09182001\I941450.raw 

3 <110> APPLICANT: Case, Casey C. 

4 Urnov, Fyodor 

6 <12 0> TITLE OF INVENTION: GENE IDENTIFICATION 

8 <130> FILE REFERENCE: S7.US3 / 8325-0007.20 ~~ ~ . . ^ 

C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/941,4 50 ^ 
C--> 11 <141> CURRENT FILING DATE: 2001-08-28 

13 <150> PRIOR APPLICATION NUMBER: 09/395,448 

14 <151> PRIOR FILING DATE: 1999-09-14 
16 <160> NUMBER OF SEQ ID NOS : 2 3 

18 <170> SOFTWARE: Patentin Ver . 2.1 , j' 

20 <210> SEQ ID NO: 1 - ! \i I i 

21 <211> LENGTH: 25 

22 <212> TYPE: PRT 

23 <213> ORGANISM: Artificial Sequence 

25 <220> FEATURE: 

26 <223> OTHER INFORMATION: Description of Artificial Sequence : exemplary mot 

27 of C2H2 class of zinc finger proteins (ZFP) 

29 <220> FEATURE: 

30 <221> NAME/KEY: MOD_RES 

31 <222> LOCATION: (2).. (3) 

32 <223> OTHER INFORMATION: Xaa = any amino acid 

34 <220> FEATURE: 

35 <221> NAME/KEY: MOD_RES 

36 <222> LOCATION: (4).. (5) 

37 <223> OTHER INFORMATION: Xaa = any amino acid, may be present or absent 
39 <220> FEATURE: 

4 0 <2 21> NAME/KEY: MOD_RES 

41 <222> LOCATION: (7).. (18) 

42 <22 3> OTHER INFORMATION: Xaa = any amino acid 
44 <220> FEATURE: 

4 5 <221> NAME/KEY: MOD_RES 

46 <222> LOCATION: (20).. (22) 

47 <223> OTHER INFORMATION: Xaa = any amino acid 

49 <220> FEATURE: 

50 <221> NAME/KEY: MOD_RES 

51 <222> LOCATION: (23).. (24) O 

52 <22 3> OTHER INFORMATION: Xaa = any amino acid, may be present or absent 
54 <400> S&QUEjaCE: 1 y y^ X ^ 

W--> 5 5 Cys Xa/ Xai Xa/xaa Cys Xaa Xa^ Xaa^Xaa^Xaa^aa Xaa^X^ X^ Xaa 
56 y y . ^ XO 15 

W--> 58 Xa^ Xaa His Xaa Xaa Xaa Xd^ Xaa His 
59 20 25 

62 <210> SEQ ID NO: 2 

63 <211> LENGTH: 10 

64 <212> TYPE: DNA 

65 <213> ORGANISM: Artificial Sequence 
6 7 <22 0> FEATURE: 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,450 TIME: 12:46:55 

Input Set : A: \seqlist , txt 

Output Set: N:\CRF3\09182001\I941450.raw 

68 <223> OTHER INFORMATION: Description of Artificial Sequence :ZFP target site^ 

69 with two overlapping D-able subsites 

71 <220> FEATURE: 

72 <221> NAME/KEY: modif ied_base 

73 <222> LOCATION: (1)..(2) 

74 <22 3> OTHER INFORMATION: n = g, a, c or t 

76 <220> FEATURE: 

77 <221> NAME/KEY: modif ied_base 

78 <222> LOCATION: (5) 

7 9 <2 2 3> OTHER INFORMATION: n = g, a, c or t 

81 <220> FEATURE: 

82 <221> NAME/KEY: modif ied_base 

83 <222> LOCATION: (8) 

84 <2 2 3> OTHER INFORMATION: n = g, a, c or t 

86 <22 0> FEATURE: 

87 <221> NAME/KEY: modif ied_base 

88 <222> LOCATION: (9) 

89 <223> OTHER INFORMATION: n = a, c or t; if g, then position 10 cannot be g 

90 or t 

92 <400>^EQUENCE: 2 

9 3 imgkrigkmli^ 10 

96 <210> SEQ ID NO: 3 

97 <211> LENGTH: 10 

98 <212> TYPE: DNA 

99 <213> ORGANISM: Artificial Sequence 

101 <220> FEATURE: 

102 <223> OTHER INFORMATION: Description of Artific>^^^ Sequence : ZFP target site 

103 with three overlapping D-able subsites ^ 

105 <2 2 0> FEATURE: 

106 <221> NAME/KEY: modif ied_base 

107 <222> LOCATION: (1)..(2) 

108 <22 3> OTHER INFORMATION: n = g, a, c or t 

110 <220> FEATURE: 

111 <221> NAME/KEY: modif ied_base 

112 <222> LOCATION: (5) 

113 <223> OTHER INFORMATION: n = g, a, c or t 

115 <220> FEATURE: 

116 <221> NAME/KEY: modif ied_base 

117 <222> LOCATION: (8) 

118 <223> OTHER INFORMATION: n = g, a, c or t 



120 <^Q.0> SEQUENCE: 3 
W--> 121 Imgkffgkngk 10 



124 <210> SEQ ID NO: 4 

125 <211> LENGTH: 5 

126 <212> TYPE: PRT 

127 <213> ORGANISM: Artificial Sequence 

129 <220> FEATURE: 

130 <223> OTHER INFORMATION: Description of Artificial Sequence : linker 
132 <400> SEQUENCE: 4 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,450 TIME: 12:46:55 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\09182001\I941450.raw 

133 Asp Gly Gly Gly Ser 

134 1 5 

137 <210> SEQ ID NO: 5 

138 <211> LENGTH: 5 

139 <212> TYPE: PRT 

140 <213> ORGANISM: Artificial Sequence 

142 <220> FEATURE: 

143 <223> OTHER INFORMATION: Description of Artificial Sequence : linker 
145 <400> SEQUENCE: 5 ^ 
14 6 Thr Gly Glu Lys Pro 

147 1 5 

150 <210> SEQ ID NO: 6 

151 <211> LENGTH: 9 

152 <212> TYPE: PRT 

153 <213> ORGANISM: Artificial Sequence 

155 <220> FEATURE: 

156 <223> OTHER INFORMATION: Description of Artificial Sequence : linke^^ 

158 <400> SEQUENCE: 6 

159 Leu Arg Gin Lys Asp Gly Glu Arg Pro 

160 1 5 

163 <210> SEQ ID NO: 7 

164 <211> LENGTH: 4 

165 <212> TYPE: PRT 

166 <213> ORGANISM: Artificial Sequence 
16 8 <2 2 0> FEATURE: 

169 <223> OTHER INFORMATION: Description of Artificial Sequence : linke,^^ 

171 <4 00> SEQUENCE: 7 

172 Gly Gly Arg Arg 

173 1 

176 <210> SEQ ID NO: 8 

177 <211> LENGTH: 5 

178 <212> TYPE: PRT 

179 <213> ORGANISM: Artificial Sequence 

181 <220> FEATURE: 

182 <223> OTHER INFORMATION: Description of Artificial Sequence : linker/^ 

184 <4 00> SEQUENCE: 8 

185 Gly Gly Gly Gly Ser 

186 1 5 

189 <210> SEQ ID NO: 9 

190 <211> LENGTH: 8 

191 <212> TYPE: PRT 

192 <213> ORGANISM: Artificial Sequence 

194 <220> FEATURE: 

195 <223> OTHER INFORMATION: Description of Artificial Sequence : linkery^ 

197 <4 00> SEQUENCE: 9 ^ 

198 Gly Gly Arg Arg Gly Gly Gly Ser 

199 1 5 

202 <210> SEQ ID NO: 10 

203 <211> LENGTH: 9 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,450 TIME: 12:46:55 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\09182001\I941450.raw 

204 <212> TYPE: PRT 

205 <213> ORGANISM: Artificial Sequence 

207 <220> FEATURE: 

208 <223> OTHER INFORMATION: Description of Artificial Sequence : linker >^ 

210 <400> SEQUENCE: 10 

211 Leu Arg Gin Arg Asp Gly Glu Arg Pro 

212 1 5 

215 <210> SEQ ID NO: 11 

216 <211> LENGTH: 12 

217 <212> TYPE: PRT 

218 <213> ORGANISM: Artificial Sequence 

220 <220> FEATURE: 

221 <223> OTHER INFORMATION: Description of Artificial Sequence : 1 inker/" 

223 <400> SEQUENCE: 11 ^ 

224 Leu Arg Gin Lys Asp Gly Gly Gly Ser Glu Arg Pro 

225 1 5 10 

228 <210> SEQ ID NO: 12 

229 <211> LENGTH: 16 

230 <212> TYPE: PRT 

231 <213> ORGANISM: Artificial Sequence 

233 <22 0> FEATURE: . 

234 <223> OTHER INFORMATION: Description of Artificial Sequence : 1 inke/ 
236 <400> SEQUENCE: 12 

2 37 Leu Arg Gin Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 
238 1 5 10 15 ' 

241 <210> SEQ ID NO: 13 

242 <211> LENGTH: 97 

243 <212> TYPE: PRT 

244 <213> ORGANISM: Artificial Sequence 
24 6 <22 0> FEATURE: 

247 <223> OTHER INFORMATION: Description of Artificial Sequence : ZFP sequence inX 

248 control construct ^ 
250 <400> SEQUENCE: 13 



251 


Val 


Pro 


Gly 


Lys 


Lys 


Lys 


Gin 


His 


He 


Cys 


His 


He 


Gin 


Gly 


Cys 


Gly 


252 


1 








5 










10 










15 


254 


Lys 


Val 


Tyr 


Gly 


Gly 


His 


Asp 


Thr 


Val 


Val 


Gly 


His 


Leu 


Arg 


Trp 


His 


255 








20 










25 










30 




257 


Thr 


Gly 


Glu 


Arg 


Pro 


Phe 


Met 


Cys 


Thr 


Trp 


Ser 


Tyr 


Cys 


Gly 


Lys 


Arg 


258 






35 










40 










45 




260 


Phe 


Thr 


Ala 


Ala 


Asp 


Glu 


Val 


Gly 


Leu 


His 


Lys 


Arg 


Thr 


His 


Thr 


Gly 


261 




50 










55 










60 








263 


Glu 


Lys 


Lys 


Phe 


Ala 


Cys 


Pro 


Glu 


Cys 


Pro 


Lys 


Arg 


Phe 


Met 


Leu 


Val 


264 


65 










70 










75 










80 


266 


Val 


Ala 


Thr 


Gin 


Leu 


His 


He 


Lys 


Thr 


His 


Gin 


Asn 


Lys 


Lys 


Gly 


Gly 



267 85 90 95 

269 Ser 

273 <210> SEQ ID NO: 14 

274 <211> LENGTH: 292 

275 <212> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 09/18/2001 

PATENT APPLICATION: US/09/941,450 TIME: 12:46:55 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\09182001\I941450.raw 

276 <213> ORGANISM: Artificial Sequence 

278 <220> FEATURE: 

279 <223> OTHER INFORMATION: Description of Artificial Sequence : des igned ZFP 

280 construct (from Kpnl to BamHI) targej^ng 9 -base 

281 pair target site in VEGF promoter ^.^^ 
2 83 <22 0> FEATURE: 

284 <221> NAME/KEY: CDS 

285 <222> LOCATION: (2).. (292) 

287 <400> SEQUENCE: 14 

288 g gta ccg ggc aag aag aag cag cac ate tgc cac ate cag ggc tgt ggt 49 

289 Val Pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly Cys Gly 
290 

292 aaa 
29 3 Lys 
294 

296 acc 

297 Thr 
298 

300 ttc 

301 Phe 
302 

304 gag 

305 Glu 

306 65 

308 gac 

309 Asp 

310 85 90 95 

312 tec 

313 Ser 

316 <210> SEQ ID NO: 15 

317 <211> LENGTH: 97 

318 <212> TYPE: PRT 

319 <213> ORGANISM: Artificial Sequence 



1 








5 








10 








15 




gtt 


tac 


ggc 


cgc 


tec 


gac 


aac 


ctg 


acc 


cgc 


cac 


ctg 


cgc 


tgg 


cac 


97 


Val 


Tyr 


Gly 


Arg 


Ser 


Asp 


Asn 


Leu 


Thr 


Arg 


His 


Leu 


Arg 


Trp 


His 








20 










25 










30 








ggc 


gag 


agg 


cct 


ttc 


atg 


tgt 


aca 


tgg 


tec 


tac 


tgt 


ggt 


aaa 


cgc 


145 


Gly 


Glu 


Arg 


Pro 


Phe 


Met 


Cys 


Thr 


Trp 


Ser 


Tyr 


Cys 


Gly 


Lys 


Arg 






35 










40 










45 










acc 


aac 


cgc 


gac 


acc 


ctg 


gee 


cgc 


cac 


aag 


cgt 


acc 


cac 


ace 


ggt 


193 


Thr 


Asn 


Arg 


Asp 


Thr 


Leu 


Ala 


Arg 


His 


Lys 


Arg 


Thr 


His 


Thr 


Gly 




50 










55 










60 












aag 


aaa 


ttt 


get 


tgt 


ccg 


gaa 


tgt 


ccg 


aag 


cgc 


ttc 


atg 


cgc 


tec 


241 


Lys 


Lys 


Phe 


Ala 


Cys 


Pro 


Glu 


Cys 


Pro 


Lys 


Arg 


Phe 


Met 


Arg 


Ser 












70 










75 










80 




cac 


ctg 


tec 


aag 


cac 


ate 


aag 


acc 


cac 


cag 


aac 


aag 


aag 


ggt 


gga 


289 


His 


Leu 


Ser 


Lys 


His 


He 


Lys 


Thr 


His 


Gin 


Asn 


Lys 


Lys 


Gly 


Gly 





292 



20 <220> FEATURE 



INFORMATION: Description of Artificial Sequence : designed ZFFy^ 
ICE: 15 



320 <223> OTHER 

324 <400> SEQUENCE: 

325 Val Pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly Cys Gly 

326 15 10 15 

328 Lys Val Tyr Gly Arg Ser Asp Asn Leu Thr Arg His Leu Arg Trp His 

329 20 25 30 

331 Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly Lys Arg 

332 35 40 45 

334 Phe Thr Asn Arg Asp Thr Leu Ala Arg His Lys Arg Thr His Thr Gly 

335 50 55 60 

337 Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met Arg Ser 

338 65 70 75 80 
34 0 Asp His Leu Ser Lys His He Lys Thr His Gin Asn Lys Lys Gly Gly 
341 85 90 95 

343 Ser 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/941,450 



DATE: 09/18/2001 
TIME: 12:46:56 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\09182001\I941450.raw 



L:10 M:270 C: Current Application Number differs, Replaced Application Number 

L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:55 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 

L:58 M:341 W: (46) "n" or, "Xaa" used, for SEQ ID#:1 

L:93 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 

L:121 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:320 M:258 W: Mandatory Feature missing, <220> FEATURE: 
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